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Abstract 

Background: Online tools and databases based on multi-allelic short tandem repeat polymorphisms (STRPs) are 
actively used in forensic teaching, research, and investigations. The Fst value of each CODIS marker tends to be low 
across the populations of the world and most populations typically have all the common STRP alleles present 
diminishing the ability of these systems to discriminate ethnicity. Recently, considerable research is being 
conducted on single nucleotide polymorphisms (SNPs) to be considered for human identification and description. 
However, online tools and databases that can be used for forensic research and investigation are limited. 

Methods: The back end DBMS (Database Management System) for FROG-kb is Oracle version 10. The front end is 
implemented with specific code using technologies such as Java, Java Servlet, JSP, JQuery, and GoogleCharts. 

Results: We present an open access web application, FROG-kb (Forensic Research/Reference on Genetics- 
knowledge base, http://frog.med.yale.edu), that is useful for teaching and research relevant to forensics and can 
serve as a tool facilitating forensic practice. The underlying data for FROG-kb are provided by the already 
extensively used and referenced ALIele FREquency Database, ALFRED (http://alfred.med.yale.edu). In addition to 
displaying data in an organized manner, computational tools that use the underlying allele frequencies with user- 
provided data are implemented in FROG-kb. These tools are organized by the different published SNP/marker 
panels available. This web tool currently has implemented general functions possible for two types of SNP panels, 
individual identification and ancestry inference, and a prediction function specific to a phenotype informative panel 
for eye color. 

Conclusion: The current online version of FROG-kb already provides new and useful functionality. We expect 
FROG-kb to grow and expand in capabilities and welcome input from the forensic community in identifying 
datasets and functionalities that will be most helpful and useful. Thus, the structure and functionality of FROG-kb 
will be revised in an ongoing process of improvement. This paper describes the state as of early June 2012. 

Keywords: Knowledge base, SNP, InDels, Forensics, Individual identification, Ancestry inference markers, Lineage 
informative markers 



Background Europe consist of eight loci that overlap with CODIS 

It is considerably more than a decade since the forensic plus seven additional markers that include the five new 

community settled on a set of short tandem repeat European Standard Set (ESS) [2], Discussions on the best 

(STR) polymorphisms (hence also STRP) for human options on expanding the core sets of loci are underway 

identity testing [1]. These markers are multi-allelic and [3-5]. Development of reliable commercial multiplex kits 

are excellent for individual matching of suspect and tailored specifically for these sets of markers has led to 

crime scene DNA. While 13 STRs form the core of large offender databases and large amounts of allele fre- 

the FBI Laboratory's CODIS (Combined DNA Index quency data accumulated for these markers on a wide 

System), the 10 core loci used in the UK and much of range of populations around the world. Online tools and 

databases have followed to allow users to reference and 
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Calculator [6] (http://www.csfs.ca/pplus/profiler.htm), 
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European Network of Forensic Science Institute's DNA WG 
STR Population Database (http://www.str-base.org/index. 
php), STRBase (http://www.cstl.nist.gov/strbase/) [7], Pop- 
Affiliator (http://cracs.fc.up.pt/popaffiliator/) [8], and pop. 
STR (http://spsmart.cesga.es/popstr.php) [9]. One argument 
for continuing use of these tools and marker panels is the 
large number of individual offender DNA profiles in data- 
bases allowing cold hits,' that is, identification of the criminal 
based on a database match to crime scene DNA. The exten- 
sive allele frequency data that have been accumulated over 
the years in large public databases also allow population- 
specific estimates of the probability of a random match of 
two unrelated individuals. However, it is exactly the high 
level of polymorphism in almost all populations that limits 
the ability of these markers to determine ancestry of an indi- 
vidual. The large numbers of alleles and high heterozygosity 
relate to the high mutation rates of these loci; this also 
means that matching of STRP alleles is matching by state 
and not of alleles that are identical by descent. 

Considering the ease, accuracy, and efficiency in typing 
single nucleotide polymorphisms (SNPs) and their essen- 
tially zero rate of recurrent mutation when compared with 
STRPs [10-12], SNPs have the potential to be considered 
for human identification and description in forensic, 
biomedical, association, as well as epidemiological studies 
[13-15]. Insertion-deletion polymorphisms (InDels) have 
most of the desirable characteristics of SNPs and panels 
have been proposed and have begun to be used in forensics 
both for individual identification and for ancestry inference 
[16-18]. However, considerable research is required to es- 
tablish a reliable set of markers (SNPs or InDels) containing 
sufficient numbers of markers to provide excellent discrim- 
inatory power comparable to or exceeding that of STR mar- 
kers. Not only do multiple populations need to be studied 
to identify the best markers but interpretation of results in 
any application needs the reference allele frequencies in 
multiple populations. The discriminatory power for individ- 
ual identification will be population specific and ancestry 
inference will only be as good as the set of reference popu- 
lations. Online web tools that demonstrate classification 
algorithms are being developed for SNP sets as well, 'The 
Snipper' app suite (http://mathgene.usc.es/snipper/) for 
three ancestry informative AISNP sets (34, 32, and 77 mar- 
kers) being one of them [19]. Currently, the main limita- 
tions of these tools are the number of SNP sets and the 
range of population data available for computation. 

In this paper we introduce FROG-kb (Forensic Re- 
source/Reference on Genetics knowledge base), an open 
access web tool that allows viewing and retrieval of data 
as well as calculation of statistics on several forensically 
relevant SNP sets. FROG-kb s user interface is versatile 
in its functionality and comprehensive in the population 
data available for many SNP sets. The overall goal of 
FROG-kb is to make allele frequency data for SNPs and 



other genetic polymorphisms more accessible and useful 
in a forensic setting. Ancestry, Individual, and Phenotype 
Informative (AISNP, IISNP, and PISNP) panels [19-24] 
studied and published from the host lab and elsewhere 
are currently available in FROG-kb. Each of these panels 
exists with supporting population data. URL links exist 
to ALFRED for more details and allele frequency data 
tables for specific populations not only for the panels 
themselves but also for each SNP in each panel. Add- 
itional information in the underlying ALFRED database 
and the curated links into other databases make FROG- 
kb a reference source as well. As frequency data on a 
population not in the original publication become com- 
prehensively available for the full SNP panel, that popu- 
lation is included in the computations. As new 
forensically relevant panels of SNPs with meaningful 
population data are published, they will be systematically 
added to FROG-kb. The structure and types of contents 
in FROG-kb are described below followed by descrip- 
tions of current functionality with examples. 

Methods 

Database structure 

The underlying data for the FROG-kb implementation are 
from the allele frequency database ALFRED (http://alfred. 
med.yale.edu) [25,26]. ALFRED is a relational database 
and aspects of the structure and relationships of the tables 
exist in the above mentioned publications. Additional 
tables accommodate information essential for a human 
identity testing application. Figure 1 gives the database 
tables and relationships incorporating only the supplemen- 
tary tables relevant to FROG-kb. The logic supporting the 
design of the additional tables and relationships follows. 
(FORENSIC_PANEL). Every panel is linked to at least one 
publication (PUB_FORENSIC_PANEL). Such links are 
clearly identified when the underlying data are unpublished 
to document the source of the data. The marker phenotype 
(equivalent to 'genotype' based on multiple unavoidable 
assumptions) frequency for each site - population sample' 
combination is pre-calculated and saved in 'FORENSI- 
CPHENOFREQ'. Since these population-marker frequen- 
cies do not change for existing data, pre-calculation of the 
phenotype summary is reasonable and expedites the 
involved case-specific computations. While all allele fre- 
quency data required for running the computations in 
FROG-kb were already in the ALFRED database, the new 
tables provide the framework for displaying information 
related to the different forensically relevant SNP sets in an 
efficient and user-friendly manner. 

Implementation 

The database for FROG-kb is implemented using Oracle 
version 10 on one of Yale's institutional database servers 
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Figure 1 Additional tables added to ALFRED for FROG-kb. The full relational schema of ALFRED is described in [26]; these tables and their 
interconnections illustrate the additions needed for the functionality of FROG-kb. 



where it is maintained. The web front end is built using 
web developing technologies such as Java, Java Servlet, 
JSP, JQuery, and GoogleCharts. Almost all of the client- 
code utilizes JQuery, and the server implementation is in 
Java. The currently deployed version of FROG-kb has 
been tested on both PC and Mac, using many different 
browsers: Mozilla Firefoxll.O, Internet Explorer 8.0, 
Safari 5.1.4, and Google Chrome 17.0 on PC, and Fire- 
foxll.O, Internet Explorer 5.0, Safari 5.0 on Mac. We are 
using Tomcat as our web server which runs on a Win- 
dows XP machine. 

Functionalities 

The user interface layout of FROG-kb is designed to re- 
flect the organization of the contents and functionality, 
as well as ease of use. Every set of pages relative to a 
function on FROG-kb originates from a tab on the 'Main 
Menu' (Figure 2A) that appears on the left-hand side of 
every page. The 'Home Page' gives a brief summary of 
the functions available in FROG-kb and what can be 
expected soon. The procession through the interface, 
explained below, is summarized graphically under 
'Pipeline'. 

Users can navigate into the functions relevant to each 
type of SNP panel by selecting the appropriate tab: 
1ISNP' for Individual Identification SNPs, 'AISNP' for 
Ancestry Inference SNPs and 'PISNP' for Phenotype In- 
formative SNPs. Following selection of a particular SNP 



panel category (IISNP, AISNP, or PISNP), there are mul- 
tiple published panels listed. The citation information 
for each of the panels, a 'Go' button to navigate into the 
selected panel, and a 'Detailed Overview of SNPs' link to 
navigate into ALFRED are provided for each (Figure 2B). 
The link to ALFRED opens the 'SNP Sets' page within 
ALFRED into a new browser window. The SNP Set 
module in ALFRED has multiple functions, including 
the ability to see for each SNP a pie chart on Google 
Maps of frequencies for all populations with data. 

Several options are possible after entering a SNP panel 
page. The functions related to the selected panel are avail- 
able by selecting the appropriate buttons at the top of the 
page (Figure 2C). The option SNP Set provides the list of 
SNPs in the panel. The list includes the dbSNP rs -number 
of each SNP with an active link to the corresponding 
dbSNP record for molecular characterization of the SNP. 
The Populations button provides the list of populations for 
which comparable calculations can be made. This is the set 
of populations for which all SNPs in the set have allele fre- 
quency data. Conversely, many populations have data on 
additional SNPs; those SNPs are not included for the calcu- 
lations. Within the SNP Set functionality in ALFRED add- 
itional populations may have data for some, but not all 
SNPs; those populations are not included in the calcula- 
tions. Each population name within FROG-kb is an active 
link to information on the population stored within 
ALFRED; that page will open in a new browser window. 
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Figure 2 Snapshot of functionalities on FROG-kb. Portions of relevant screens seen in FROG-kb. (a) The basic navigation panel seen on the 
left side of all screens, (b) An example of one of the specific sets of SNPs, in this case an IISNP panel, with the [Go] button to enter the page with 
options for that set. Also, on the right is the link to ALFRED to view details on all the SNPs in this specific panel, (c) The upper row of buttons for 
options for a specific panel. Other panels may have a slightly different set and a second row of buttons usually has additional options. However, 
the list of the SNPs, the list of the populations, the data entry, clearing the data entry fields, and at least one example dataset are common 
buttons to all panels, (d) The top portion of the data entry page showing the radio buttons to be used to enter the genotype for each SNP. (e) 
The buttons at the bottom of the data entry screen allowing the option of autofilling for missing data and printing of the input data plus the 
[Compile] button to initiate calculation. 



The geographic region of each population is included. A 
world map in which regions are divided on an arbitrary 
but convenient basis is available from the link 'Geo- 
graphic Region Map'. Example options are also access- 
ible using Ex 1 or similar buttons. These are static 
screen shots to provide examples. 

The most significant interactive function is accessed via 
Data Entry (Figure 2D) that opens the ability to specify an 
individuals multi-site genotype (strictly, phenotype) and 
then calculate the probability of that genotype in each of 
the populations. The list of SNPs is sorted by rs-numbers 
for ease of working with the set. For each SNP on the list 
the ALFRED UID, dbSNP rs -number, chromosome, and 
chromosomal position are displayed. The ALFRED UID 
and rs -number are URL links to ALFRED and to dbSNP 
SNP information pages, respectively. This is followed by 
radio buttons for the possible genotypes. The genotype is 
entered by simply clicking on the radio button for the geno- 
type at each SNP. An obvious assumption is that there is 



no allele drop out, that is, that a typing result (phenotype) 
with only one allele detected is really a homozygote. A radio 
button labeled 'NNT is provided for missing data for each 
SNP, but it is not necessary to click on the 'NNP for missing 
data. For large SNP sets, if the user s SNP set is also ordered 
by rs-number in a spreadsheet, selecting the appropriate 
genotype radio-button should be relatively effortless. 
(We are aware this input can be tedious; more user friendly 
options are under development.) At the bottom of the list 
are three buttons (Figure 2E): Set all unselected to un- 
known, Print Format, and Compile. The Print Format will 
generate a condensed version of the input data that can be 
printed as a permanent record of the input data. The infor- 
mation in the pop-up window can also be copied and 
pasted into a text editor which in turn can be opened in an 
Excel spreadsheet. The Compile will initiate calculation 
and display the results. If there are SNPs with no selection, 
a warning will be sent with the missing data rows high- 
lighted for easy detection and the option exists to examine 
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which SNPs have no entry and to either enter a genotype 
or use the Set all unselected to unknown to fill those with 
'NN\ Afterward, it is necessary to click on Compile again. 

The calculations are essentially identical for all the 
IISNP and AISNP panels, just different loci (SNPs) and 
different interpretations are involved. All SNPs are consid- 
ered statistically independent at the population level and 
so the probability of the input multisite genotype is simply 
the product of the probabilities of the genotypes of the in- 
dividual loci— the product rule' in forensics. This calcula- 
tion is done separately for each population using the allele 
frequencies estimated for that population and assuming 
Hardy- Weinberg ratios to estimate the population-specific 
genotype probabilities. The calculation uses all loci in the 
panel for which a genotype was entered. A missing geno- 
type, NN in the input, means that locus is skipped for all 
populations. Only populations with data for all SNPs in 
the panel are considered in the calculation. Thus, the 
resulting probabilities/likelihoods that are displayed are 
based on the same set of loci for all populations. 

Special consideration is needed for those situations in 
which one allele is not observed in a population and hence 
the observed allele frequency is zero and two genotypes 
are strictly estimated to be zero. This is especially import- 
ant for AISNPs since many of the loci are fixed in some 
populations, but even a SNP in an IISNP panel may not 
be seen in an isolated population. Given the sample sizes 
involved, very low frequencies of the missing' allele cannot 
be excluded and using a value of zero in the calculations 
would be incorrect. The approach used is to simply use a 
very small allele frequency instead of the zero that is the 
allele frequency present in ALFRED. If one assumes a het- 
erozygote might be seen in the next individual sampled 
from the population, the allele frequency would then be 1/ 
(2n + 2) where n is the original sample size in which the 
allele was not seen. Other approaches to the problem can 
exist. Because of different finite sample sizes and the un- 
avoidable possibility of typing error, an absolute small fre- 
quency could be used. Another alternative is to simply not 
include that locus in any calculations involving a popula- 
tion in which one allele is not seen in the reference sam- 
ple. This last completely circumvents a fixed locus having 
an undue influence on population rankings for AISNPs, 
but also overcompensates when those alleles are seen in 
the focus genotype. Those and other options can be con- 
sidered by the community for future implementation as 
alternatives. 

The time required to compute the probability of the 
genotype in each population depends on the number of 
SNPs in the panel and the number of populations for 
which the calculations are performed. The results page has 
a table with the probability values against each population 
name and a graph of log 10 (Probability of Genotype) dis- 
played side-by-side. The number of SNPs used in the 



computation is given. The button View SNPs Used gives 
the list of SNPs the computation was based on. Print Table 
Format generates a printable format of the result. The line 
graph of log 10 (Probability of Genotype) is drawn utilizing 
the Google-Chart tools. The population name and the cor- 
responding value are displayed when the mouse is hovered 
over a point (Figure 3). The geographic region displayed 
adjacent to a population name can be verified using the 
image at Geographic Region Map. 

There are also buttons with population names, for ex- 
ample Hungarian or Korean that will open a pre-entered 
data entry page for one individual from the specified 
population. These can be used in an education mode to 
explore the dependence of the results on particular SNPs 
and on the total number of SNPs with data. 

The above mentioned functionalities are common to 
all of the IISNP and AISNP panels. The interpretations 
of the values calculated differ between the IISNP and 
AISNP panels (see below). We note that, since the calcu- 
lations are based on only the SNPs with specified geno- 
types, it is not necessary that an entire panel be 
genotyped. Indeed, if by happenstance a user has geno- 
typed SNPs that are in a panel, those SNPs will yield a 
valid result though there may be little discrimination if 
only a few SNPs are used. 

The single PISNP panel differs in that calculations are 
not dependent on population but only on the genotype 
entered. The Irisplex panel for eye color prediction 
under PISNP has the SNP Set, Data Entry and four pre- 
entered data entry pages. The eye color prediction com- 
putation uses the formula provided in the publication 
[21] and is given on the web site. 

Results and Discussion 

Interpretation of the calculations 

For both the IISNP and AISNP input genotypes the pro- 
grams currently calculate for each specific population 
the probability of that multi-locus genotype as the sim- 
ple product across loci of the frequencies of each specific 
genotype in the specific population. That simple product 
is the one displayed for the population in the rank- 
ordered list and in the graph. The interpretation is dif- 
ferent for the two types of SNP panels. 

For IISNP panels the 'Probability of Genotype' can be 
interpreted as the match probability, that is, the prob- 
ability of finding another unrelated individual in the 
population with the same multilocus genotype. One as- 
sumption is infinite population size with no adjustment 
for finite size or the statistical uncertainty of the allele 
frequency estimates for each population other than the 
zero allele frequency correction noted above. Another 
assumption is of statistically independent loci, that is, no 
linkage disequilibrium. This second assumption has been 
tested, and satisfied, by analyses in the studies that 
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Figure 3 Graph of log 10 (Probability of Genotype). An example of the plot of results for one individual input genotype with the populations 
ordered by the log 10 probabilities from largest to smallest. This is part of the results display common to all MSN P and AISNP panel calculations. 
One can mouse over the graph to see the specific population and its value, as illustrated for Portuguese in this example. Not shown here is the 
list of those populations with the values also displayed on the same page. The option exists to print the list. 



developed and published the panels. The population- 
specific values are given in the table ordered from high- 
est to lowest probability, but is easier to visualize in the 
figure as the orders of magnitude showing how sensitive 
the match probability estimates are to the population 
from which the DNA sample was obtained. Of particular 
note is that the top population gives the highest match 
probability globally, to the extent the populations repre- 
sent the global variation. Note that these values are ac- 
ceptable estimates for whatever data were used. If there 
were many loci with missing data, the probabilities will 
be less definitive but will still be the best estimates for 
the loci used. 

For AISNP panels the same calculation is made, simple 
multiplication of the probability of the population-specific 
genotypes at the several loci. For AISNPs, however, the in- 
terpretation of the probability of the genotype given a popu- 
lation can be considered as proportional to the likelihood of 
the population given the genotype. As likelihoods, the abso- 
lute values have no strict interpretation, only the relative 
values are interpretable. Thus, the rank orders are meaning- 
ful with populations having larger probabilities being more 
likely origins of the individual DNA profile than popula- 
tions with smaller probabilities. The important question 
with respect to ancestry inference is how different the likeli- 
hoods are. One rough rule of thumb that could be applied 
is that likelihoods different by less than an order of magni- 
tude are not significantly different. Thus, while the popula- 
tion with the highest likelihood is the most likely origin of 
the input genotype, it is not necessarily the correct origin 
and those with similar, albeit lower likelihoods, cannot be 



excluded. Indeed, even those more than an order of magni- 
tude less likely are not excluded,' just much less likely. 

The likelihoods calculated are the maximum likelihood 
estimates because they are based on the maximum likeli- 
hood estimates for allele frequencies for independent loci 
with the necessary assumption of Hardy- Weinberg pro- 
portions for the genotypes, albeit with a correction for 
sites with a zero allele frequency. However, those allele 
frequency estimates have associated standard errors that 
vary inversely as the sample sizes. Thus, there are greater 
uncertainties for likelihoods calculated for populations 
with small sample sizes. Hence, rankings among popula- 
tions with very similar likelihoods could be different when 
one or more is based on a very small sample size. The stat- 
istical issues with determining significance of an estimated 
rank order of populations, taking into account all the indi- 
vidual components of uncertainty, are not simple and we 
currently have no rigorous statistical method identified 
and implemented. We are not aware of any of the existing 
AISNP estimation procedures that has solved this prob- 
lem. We emphasize that users must exercise judgment by 
recognizing the inherent uncertainty and considering the 
differences in sample sizes (displayed with each sample 
name). 

Current panels and examples 

Two different Individual Identification SNP panels 
(IISNPs) provide examples and the ability to calculate 
match probabilities for user-specified genotypes in each 
of many populations that have allele frequencies avail- 
able for all SNPs in the panel. The two panels are (A) 45 
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unlinked IISNPs from the host lab [20] and (B) the 
SNPforlD 52-plex [21]. Similarly, for Ancestry Inference 
SNPs (AISNPs), this pilot implementation provides 
examples and ability to calculate relative likelihoods of 
ancestry from different populations for user-specified 
genotypes. FROG-kb has already implemented three dif- 
ferent AISNP panels. One is a provisional unpublished 
panel of 39 SNPs assembled specifically to test function- 
ality of this web application. We have included an illus- 
tration of the STRUCTURE output for these AISNPs to 
document their validity. A set of 128 AISNPs [22,23] 
and the SNPforlD panel of 34 SNPs [19] are two add- 
itional panels for which examples and calculations are 
available. For Phenotype Informative SNPs (PISNPs) we 
provide a panel of six SNPs for eye color prediction 
(IrisPlex) along with ability to specify an individuals 
genotype and predict eye color from that [24]. 

Limitations 

FROG-kb can only provide information for the popula- 
tions comprehensively tested for the entire set of SNPs. 
For ancestry inference FROG-kb is currently designed 
for individuals whose ancestry is overwhelmingly from 
one population or set of closely related populations. 
Admixed individuals, in the sense of recent ancestors 
from geographically and genetically different popula- 
tions, will not necessarily provide meaningful results. 
For example, one African American genotype gave Ethi- 
opian as the most likely ancestry, understandable be- 
cause the Ethiopians have allele frequencies at many loci 
that are intermediate between those of West African and 
European populations. Other African American geno- 
types have given West African populations as most 
likely. Obviously, different African Americans have dif- 
ferent combinations of the African and European alleles 
for the particular AISNPs in a panel. Thus, results for 
individuals of recent admixed ancestry will be specific to 
the individual and the AISNP panel. 

Currently the allele designations for the genotypes 
listed on the data entry forms are not consistent with 
any single standard. Since most SNPs are unambiguous 
with respect to the strand being called, the user should 
have no problem making the necessary conversion from 
the typing data to the genotype codes on the input screen. 
However, G/C and A/T SNPs need to be specified as to 
the strand being called; the future standard for FROG-kb 
will be that the positive strand (pter to qter, 5' to 3') will be 
the reference even for SNPs in genes that are coded on the 
reverse strand. 

Only a small selection of the SNP and InDel panels 
that have been published is currently available for use in 
FROG-kb. The initial effort in developing this resource 
for the forensic community has necessarily focused on 
the database structure and the website interface. Several 



other panels of markers have already been identified and 
work has begun to curate the data and make the mater- 
ial accessible and useful via FROG-kb. We also expect 
the forensic and research communities to help us iden- 
tify data that should be included but is not. 

Conclusion 

While FROG-kb is a work in progress, the current ver- 
sion of FROG-kb already provides new and useful func- 
tionality. We expect FROG-kb to grow and expand in 
capabilities over the next several months. Indeed, by the 
time this initial announcement reaches official publica- 
tion there will likely be changes. We hope that input 
from the forensic community will help identify those 
functionalities that are most helpful and useful. We ex- 
pect FROG-kb to be a useful reference and resource on 
use of SNP panels in forensics. 
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